Perception and Comprehension of Synthetic Speech

نویسندگان

  • Stephen J. Winters
  • David B. Pisoni
چکیده

An extensive body of research on the perception of synthetic speech carried out over the past 30 years has established that listeners have much more difficulty perceiving synthetic speech than natural speech. Differences in perceptual processing have been found in a variety of behavioral tasks, including assessments of segmental intelligibility, word recall, lexical decision, sentence transcription, and comprehension of spoken passages of connected text. Alternative groups of listeners—such as non-native speakers of English, children and older adults—have even more difficulty perceiving synthetic speech than young, healthy, college-aged listeners typically tested in perception studies. It has also been shown, however, that the ability to perceive synthetic speech improves rapidly with training and experience. Incorporating appropriate prosodic contours into synthetic speech algorithms—along with providing listeners with higherlevel contextual information—can also aid the perception of synthetic speech. Listener difficulty in processing synthetic speech has been attributed to the impoverished acousticphonetic segmental cues—and inherent lack of natural variability and acoustic-phonetic redundancy—in synthetic speech produced by rule. The perceptual difficulties that listeners have in perceiving speech which lacks acoustic-phonetic variability has been cited as evidence for the importance of variability to the perception of natural speech. Future research on the perception of synthetic speech will need to investigate the sources of acoustic-phonetic variability and redundancy that improve the perception of synthetic speech, as well as determine the efficacy of synthetically produced audio-visual speech, and the extent to which the impoverished acoustic-phonetic structure of synthetic speech impacts higher-level comprehension processes. New behavioral methods of assessing the perception of speech by human listeners will need to be developed in order for our understanding of synthetic speech perception to keep pace with the rapid progress of speech synthesis technology.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Examining the Association between T-unit and Pausing Length on the EFL Perception of Listening Comprehension

Listening taking over half of the learners’ time and effort (Nunan, 1998), forms a basis for acquiring much of a language. There are factors affecting listening comprehension and its perception, such as the speech rate, phonological properties of the text, the quality of the recording, the learners’ anxiety, and listening comprehension strategies (Goh, 2000; Hamouda, 2013). At the Iran Language...

متن کامل

Audible Hyperlinks in Synthetic Speech: Effects of Speech and Non-speech Cues on Hyperlink Perception & Sentence Comprehension

This paper describes two empirical experiments investigating the perception of embedded audible hyperlinks, designed using speech and non-speech cues, and their effect on the comprehension of synthetic speech. Results from the first experiment showed high accuracy levels of hyperlink perception and differences in comprehension performance between sentences with hyperlinks and sentences without ...

متن کامل

Perception of Synthetic Speech

This chapter sununarizes the results we obtained over the last 15 years at Indiana University on the perception of synthetic speech produced by rule. A wide variety of behavioral studies have been carried out on phoneme intelligibil­ ity, word recognition, and comprehension to learn more about how human listeners perceive and understand synthetic speech. Some of this research, particularly the ...

متن کامل

CUEING HYPERLINKS IN AUDITORY INTERFACES Research paper for the ICAD05 workshop "Combining Speech and Sound in the User Interface"

This paper describes two empirical experiments investigating the perception of embedded audible hyperlinks, designed using speech and non-speech cues, and their effect on the comprehension of synthetic speech. Results from the first experiment showed high accuracy levels of hyperlink perception and differences in comprehension performance between sentences with hyperlinks and sentences without ...

متن کامل

CUEING HYPERLINKS IN AUDITORY DISPLAYS Research paper for the ICAD05 workshop "Combining Speech and Sound in the User Interface"

This paper describes two empirical experiments investigating the perception of embedded audible hyperlinks, designed using speech and non-speech cues, and their effect on the comprehension of synthetic speech. Results from the first experiment showed high accuracy levels of hyperlink perception and differences in comprehension performance between sentences with hyperlinks and sentences without ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2004